Three Statistical Summarizers at CLEF-INEX 2013 Tweet Contextualization Track

نویسنده

  • Juan-Manuel Torres-Moreno
چکیده

According to the organizers, the objective of the 2014 CLEFINEX Tweet Contextualization Task is: “...The Tweet Contextualization aims at providing automatically information a summary that explains the tweet. This requires combining multiple types of processing from information retrieval to multi-document summarization including entity linking.” We present three statistical summarizer systems applied to the CLEF-INEX 2014 task. Cortex summarizer uses several sentence selection metrics and an optimal decision module to score sentences from a document source. Artex summarizer uses a simple inner product among the topic-vector and the pseudo-word vector. Reg summarizer is a performant graph-based summarizer. The results show that our systems performed well on CLEF-INEX task. Our three systems have obtained the first rank in the INEX manual evaluation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Two Statistical Summarizers at INEX 2012 Tweet Contextualization Track

According to the organizers, the objective of the 2012 INEX Tweet Contextualization Task is: “...given a tweet, the system must provide some context about the subject of the tweet, in order to help the reader to understand it. This context should take the form of a readable (and short) summary, composed of passages from [...] Wikipedia.” We present summarizers Cortex and KL-summ applied to the ...

متن کامل

Ultra-stemming and Statistical Summarization at INEX 2013 Tweet Contextualization Track

According to the organizers, the objective of the 2013 INEX Tweet Contextualization Task is: “...The Tweet Contextualization aims at providing automatically information a summary that explains the tweet. This requires combining multiple types of processing from information retrieval to multi-document summarization including entity linking.” We present the Cortex summarizer applied to the INEX 2...

متن کامل

An Automatic Greedy Summarization System at INEX 2013 Tweet Contextualization Track

According to the organizers, the aim of the 2013 INEX Tweet Contextualization Track is: “...given a tweet, the system must provide some context about the subject of the tweet, in order to help the reader to understand it. This context should take the form of a readable (and short) summary, composed of passages from [...] Wikipedia.” We present an automatic greedy summarizer named REG applied to...

متن کامل

A Method for Short Message Contextualization: Experiments at CLEF/INEX

This paper presents the approach we developed for automatic multi-document summarization applied to short message contextualization, in particular to tweet contextualization. The proposed method is based on named entity recognition, part-of-speech weighting and sentence quality measuring. In contrast to previous research, we introduced an algorithm from smoothing from the local context. Our app...

متن کامل

Testing a Statistical Word Stemmer based on Affixality Measurements in INEX 2012 Tweet Contextualization Track

This paper presents an experiment of statistical word stemming based on a xality measurements. These measurements quantify three characteristics of language. In this experiment we tested one strategy of stemming with three di erent sizes of training data. The developed stemmer was used by the automatic summarization system Cortex to preprocess input texts and produce readable summaries. All sum...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014